Noise and room acoustics distorted speech recognition by HMM composition
نویسندگان
چکیده
This paper presents a robust speech recognition method based on the HMM composition for the noisy room acoustics distorted speech. The method realizes an improved user interface such as the user is not encumbered by microphone equipments. The proposed HMM composition is obtained by naturally extending the HMM composition method of an additive noise to that of the convolutional room acoustics distortion. The HMM composition is conducted by 2 steps: 1)Composition of HMMs of a speech and acoustical transfer function in cepstrum domain, 2)Composition of distorted speech and noise HMMs in linear spectral domain. The speaker dependent/independent word recognition experiments are carried out using the speech database contaminated by the additive noise and convolutional room acoustics distortion. The evaluation experiments are also conducted for unknown testing sound source positions. These results clarified the effectiveness of the proposed method.
منابع مشابه
Improved HMM Separation for Distant-Talking Speech Recognition
In distant-talking speech recognition, the recognition accuracy is seriously degraded by reverberation and environmental noise. A robust speech recognition technique in such environments, HMM separation and composition, has been described in [1]. HMM separation estimates the model parameters of the acoustic transfer function using adaptation data uttered from an unknown position in noisy and re...
متن کاملSpeech enhancement based on hidden Markov model using sparse code shrinkage
This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...
متن کاملHMM-separation-based speech recognition for a distant moving speaker
This paper presents a hands-free speech recognition method based on HMM composition and separation for speech contaminated not only by additive noise but also by an acoustic transfer function. The method realizes an improved user interface such that a user is not encumbered by microphone equipment in noisy and reverberant environments. The use of HMM composition has already been proposed for co...
متن کاملModel Adaptation Based on Hmm Decomposition for Reverberant Speech Recognition
The performance of a speech recognizer is degraded drastically in reverberant environments. We proposed a novel algorithm which can model an observation signal by composition of HMMs of clean speech, noise and an acoustic transfer function(l]. However, how to estimate HMM parameters of the acoustic transfer function is a remaining serious problem. In our previous paperll], we measured real impu...
متن کاملModel adaptation based on HMM decomposition for reverberant speech recognition
The performance of a speech recognizer is degraded drastically in reverberant environments. We proposed a novel algorithm which can model an observation signal by composition of HMMs of clean speech, noise and an acoustic transfer function[1]. However, how to estimate HMM parameters of the acoustic transfer function is a remaining serious problem. In our previous paper[1], we measured real impu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996